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Abstract 

We investigate the problem of computing a minimum set of solutions that approximates within a 
specified accuracy e the Pareto curve of a multiobjective optimization problem. We show that for a broad 
class of bi-objective problems (containing many important widely studied problems such as shortest 
paths, spanning tree, and many others), we can compute in polynomial time an e-Pareto set that contains 
at most twice as many solutions as the minimum such set. Furthermore we show that the factor of 2 is 
tight for these problems, i.e., it is NP-hard to do better. We present upper and lower bounds for three or 
more objectives, as well as for the dual problem of computing a specified number k of solutions which 
provide a good approximation to the Pareto curve. 
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1 Introduction 



In many decision making situations it is typically the case that more than one criteria come into play. For 
example, when purchasing a product (car, tv, etc.) we care about its cost, quality, etc. When choosing a route 
we may care about the time it takes, the distance travelled, etc. When designing a network we may care 
about its cost, its capacity (the load it can carry), its coverage. This type of multicriteria or multiobjective 
problems arise across many diverse disciplines, in engineering, in economics and business, healthcare, and 
others. The area of multiobjective optimization has been (and continues to be) extensively investigated in 
the management science and optimization communities with many papers, conferences and books (see e.g. 
lOniEhHlEGllFGEllMitli y 

In multiobjective problems there is typically no uniformly best solution in all objectives, but rather a 
trade-off between the different objectives. This is captured by the trade-off or Pareto curve, the set of all 
solutions whose vector of objective values is not dominated by any other solution. The trade-off curve 
represents the range of reasonable "optimal" choices in the design space; they are precisely the optimal 
solutions for all possible global "utility" functions that depend monotonically on the different objectives. 
A decision maker, presented with the trade-off curve, can select a solution that corresponds best to his/her 
preferences; of course different users generally may have different preferences and select different solutions. 
The problem is that the trade-off curve has typically exponential size (for discrete problems) or is infinite 
(for continuous problems), and hence we cannot construct the full curve. Thus, we have to contend with an 
approximation of the curve: We want to compute efficiently and present to the decision makers a small set 
of solutions (as small as possible) that represents as well as possible the whole range of choices, i.e. that 
provides a good approximation to the Pareto curve. Indeed this is the underlying goal in much of the re- 
search in the multiobjective area, with many heuristics proposed, usually however without any performance 
guarantees or complexity analysis as we do in theoretical computer science. 

In recent years we initiated a systematic investigation BPY11 IVY] to develop the theory of multiob- 
jective approximation along similar rigorous lines as the approximation of single objective problems. The 
approximation to the Pareto curve is captured by the concept of an e-Pareto set, a set P e of solutions that 
approximately dominates every other solution; that is, for every solution s, the set P e contains a solution s' 
that is within a factor 1 + e of s, or better, in all the objectives. (As usual in approximation, it is assumed that 
all objective functions take positive values.) Such an approximation was studied before for certain problems, 
e.g. multiobjective shortest paths, for which Hansen BHanl and Warburton BWall showed how to construct 
an e-Pareto set in polynomial time (for fixed number of objectives). Note that typically in most real-life 
multiobjective problems the number of objectives is small. In fact, the great majority of the multiobjective 
literature concerns the case of two objectives. 

Consider a multiobjective problem with d objectives, for example shortest path with cost and time 
objectives. For a given instance, and error tolerance e, we would like to compute a smallest set of solutions 
that form an e-Pareto set. Can we do it in polynomial time? If not, how well can we approximate the smallest 
e-Pareto set? Note that an e-Pareto set is not unique: in general there are many such sets, some of which 
can be very small and some very large. First, to have any hope we must ensure that there exists at least a 
polynomial size e-Pareto set. Indeed, in MPY11 it was shown that this is the case for every multiobjective 
problem with a fixed number of polynomially computable objectives. Second we must be able to construct 
at least one such set in polynomial time. This is not always possible. A necessary and sufficient condition 
for polynomial computability for all e > is the existence of a polynomial algorithm for the following 
Gap problem: Given a vector of values b, either compute a solution that dominates b, or determine that no 
solution dominates b by at least a factor 1 + e (in all the objectives). Many multiobjective problems were 
shown to have such a routine for the Gap problem (and many others have been shown subsequently). 

Construction of a polynomial-size approximate Pareto set is useful, but not good enough in itself: For 
example, if we plan a trip, we want to examine just a few possible routes, not a polynomial number in the 
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size of the map. More generally, in typical multicriteria situations, the selected representative solutions are 
investigated more thoroughly by the decision maker (designer, physician, corporation, etc.) to assess the 
different choices and pick the most preferable one, based possibly on additional factors that are perhaps 
not formalized or not even quantifiable. We thus want to select as small a set as possible that achieves a 
desired approximation. In BVYH the problem of constructing a minimum e-Pareto set was raised formally 
and investigated in a general framework. It was shown that for all bi-objective problems with a polynomial- 
time Gap routine, one can construct an e-Pareto set that contains at most 3 times the number of points of the 
smallest such set; furthermore, the factor 3 is best possible in the sense that for some problems it is NP-hard 
to do better. Further results were shown for 3 and more objectives, and for other related questions. Note that 
although the factor 3 of BVYB is best possible in general for two objectives, one may be able to do better for 
specific problems. 

We show in this paper, that for an important class of bi-objective problems (containing many widely 
studied natural ones such as shortest paths, spanning tree, knapsack, scheduling problems and others) we 
can obtain a 2-approximation, and furthermore the factor of 2 is tight for them, i.e., it is NP-hard to do 
better. Our algorithm is a general algorithm that relies on a routine for a stronger version of the Gap 
problem, namely a routine that solves approximately the following Restricted problem: Given a (hard) 
bound b\ for one objective, compute a solution that optimizes approximately the second objective subject to 
the bound. Many problems (e.g. shortest paths, etc.) have a polynomial time approximation scheme for the 
Restricted problem. For all such problems, a 2-approximation to the minimum e-Pareto set can be computed 
in polynomial time. Furthermore, the number of calls to the Restricted routine (and an associated equivalent 
dual routine) is linear in the size OPT e of the optimal e-Pareto set. 

The bi-objective shortest path problem is probably the most well-studied multiobjective problem. It is 
the paradigmatic problem for dynamic programming (thus can express a variety of problems), and arises 
itself directly in many contexts. One area is network routing with various QoS criteria (see e.g. BCX2I lESZl 
IGR+llVVll ). For example, an interesting proposal in a recent paper by Van Mieghen and Vandenberghe BVVB 
is to have the network operator advertise a portfolio of offered QoS solutions for their network (a trade-off 
curve), and then users can select the solutions that best fit their applications. Obviously, the portfolio cannot 
include every single possible route, and it would make sense to select carefully an "optimal" set of solutions 
that cover well the whole range. Other applications include the transportation of hazardous materials (to 
minimize risk of accident, and population exposure) IEVH . and many others; we refer to the references, 
e.g. [EG] contains pointers to the extensive literature on shortest paths, spanning trees, knapsack, and the 
other problems. Our algorithm applies not only to the above standard combinatorial problems, but more 
generally to any bi-objective problem for which we have available a routine for the Restricted problem; the 
objective functions and the routine itself could be complex pieces of software without a simple mathematical 
expression. 

After giving the basic definitions and background in Section|2l we present in Section[3]our general lower 
and upper bound results for bi-objective problems, as well as applications to specific problems. In Section|4] 
we present some results for d = 3 and more objectives. Here we assume only a Gap routine; i.e. these results 
apply to all problems with a polynomial time constructible e-Pareto set. It was shown in BVYB that for d = 3 
it is in general impossible to get a constant factor approximation to the optimal e-Pareto set, but one has to 
relax e. Combining results from BVYH and BKPB we show that for any e' > e we can construct an e'-Pareto 
set of size cOPT e , i.e. within a (large) constant factor c of the size OPT e of the optimal e-Pareto set. For 
general d, the problem can be reduced to a Set Cover problem whose VC dimension and codimension are at 
most d, and we can construct an e'-Pareto set of size 0(dlog OPT e )OPT e . 

We discuss also the Dual problem: For a specified number k of points, find k points that provide the 
best approximation to the Pareto curve, i.e. that form an e-Pareto set with the minimum possible e. In BVYH 
it was shown that for d = 2 objectives the problem is NP-hard, but we can approximate arbitrarily well (i.e. 
there is a PTAS) the minimum approximation ratio p* = 1 + e*. As we'll see, for d = 3 this is not possible, 
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in fact one cannot get any multiplicative approximation (unless P=NP). We use a relationship of the Dual 
problem to the asymmetric fc-center problem and techniques from the latter problem to show that the Dual 
problem can be approximated (for d = 3) within a constant power, i.e. we can compute k points that cover 
every point on the Pareto curve within a factor p' = (p*) c or better in all objectives, for some constant c. (It 
follows from our results that c < 9.) For small p* , i.e. when there is a set of k points that provides a good 
approximation to the Pareto curve, constant factor and constant power are related, but in general of course 
they are not. 

2 Definitions and Background 

A multiobjective optimization problem IT has a set 2n of valid instances, every instance I £ In has a 
set of solutions S(I). There are d objective functions, fi, ■ ■ ■ , fd, each of which maps every instance I 
and solution s G 5(7) to a value fj(I,s). The problem specifies for each objective whether it is to be 
maximized or minimized. We assume as usual in approximation that the objective functions have positive 
rational values, and that they are polynomial-time computable. We use m to denote the maximum number 
of bits in numerator and denominator of the objective function values. 

We say that a ci-vector u dominates another d-vector v if it is at least as good in all the objectives, i.e. 
Uj > Vj if fj is to be maximized (uj < Vj if fj is to be minimized). Similarly, we define domination 
between any solutions according to the (i-vectors of their objective values. Given an instance I, the Pareto 
set P(I) is the set of undominated <i-vectors of values of the solutions in S(I). Note that for any instance, 
the Pareto set is unique. (As usual we are also interested in solutions that realize these values, but we will 
often blur the distinction and refer to the Pareto set also as a set of solutions that achieve these values. If 
there is more than one undominated solution with the same objective values, P(I) contains one of them.) 

We say that a d- vector u c-covers another d- vector v if u is at least as good as v up to a factor of c in all 
the objectives, i.e. Uj > Vj/cif fj is to be maximized (uj < cvj if fj is to be minimized). Given an instance 
I and e > 0, an e-Pareto set P € (I) is a set of d- vectors of values of solutions that (1 + e)-cover all vectors 
in P(I). For a given instance, there may exist many e-Pareto sets, and they may have very different sizes. It 
is shown in MPY11 that for every multiobjective optimization problem in the aforementioned framework, for 
every instance I and e > 0, there exists an e-Pareto set of size 0((4m/e) d_1 ), i.e. polynomial for fixed d. 

An approximate Pareto set always exists, but it may not be constructible in polynomial time. We say that 
a multiobjective problem II has a polynomial time approximation scheme (respectively a fully polynomial 
time approximation scheme) if there is an algorithm, which, given instance I and a rational number e > 0, 
constructs an e-Pareto set P e (I) in time polynomial in the size |/| of the instance I (respectively, in time 
polynomial in |/|, the representation size |e| of e, and in 1/e). Let MPTAS (resp. MFPTAS) denote the class 
of multiobjective problems that have a polynomial time (respectively fully polynomial time) approximation 
scheme. There is a simple necessary and sufficient condition BPY1L which relates the efficient computability 
of an e-Pareto set for a multi-objective problem IT to the following GAP Problem: given an instance I of 
IT, a (positive rational) d-vector b, and a rational 5 > 0, either return a solution whose vector dominates b 
or report that there does not exist any solution whose vector is better than b by at least a (1 + 6) factor in 
all of the coordinates. As shown in HPYli a problem is in MPTAS (resp. MFPTAS) if and only if there is 
a subroutine GAP that solves the GAP problem for II in time polynomial in |/| and |6| (resp. in |/|, |6|, 1 5\ 
and 1/5). 

We say that an algorithm that uses a routine as a black box to access the solutions of the multiobjective 
problem is generic, as it is not geared to a particular problem, but applies to all of the problems for which the 
particular routine is available. All that such an algorithm needs to know about the input instance is bounds 
on the minimum and maximum possible values of the objective functions. (For example, if the objective 
functions are positive rational numbers whose numerators and denominators have at most m bits, then an 
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obvious lower bound on the objective values is 2~ m and an obvious upper bound is 2 m ; however, for specific 
problems better bounds may be available.) Based on the bounds, the algorithm calls the given routine for 
certain values of its parameters, and uses the returned results to compute an approximate Pareto set. 

For a given instance, there may exist many e-Pareto sets, and they may have very different sizes. We want 
to compute one with the smallest possible size, which we'll denote OPT e . HVY1 gives generic algorithms 
that compute small e-Pareto sets and are applicable to all multiobjective problems in M(F)PTAS, i.e. all 
problems possessing a (fully) polynomial GAP routine. They consider the following "dual" problems: 
Given an instance and an e > 0, construct an e-Pareto set of as small size as possible. And dually, given 
a bound k, compute an e-Pareto set with at most k points that has as small an e value as possible. In the 
case of two objectives, they give an algorithm that computes an e-Pareto set of size at most 30PT e ; they 
show that no algorithm can be better than 3-approximate in this setting. For the dual problem, they show 
that the optimal value of the ratio p = 1 + e can be approximated arbitrarily closely. For three objectives, 
they show that no algorithm can be c-approximate for any constant c, unless it is allowed to use a larger e 
value. They also give an algorithm that constructs an e'-Pareto set of cardinality at most 40PT o for any 
e'> (l + e) 2 -l. 

In a general multiobjective problem we may have both minimization and maximization objectives. In 
the remainder, we will assume for convenience that all objectives are minimization objectives; this is without 
loss of generality, since we can simply take the reciprocals of maximization objectives. 

Notation: For a positive integer n G IN*, we will denote by [n] the set {1, 2, . . . , n}. 

3 Two Objectives 

We use the following notation in this section. Consider the plane whose coordinates correspond to the two 
objectives. Every solution is mapped to a point on this plane. We use x and y as the two coordinates of the 
plane. If p is a point, we use x(p), y(p) to denote its coordinates; that is, p = (x(p),y(p)) . 

We consider the class of bi-objective problems II for which we can approximately minimize one objec- 
tive (say the y-coordinate) subject to a "hard" constraint on the other (the x-coordinate). Our basic primitive 
is a polynomial time (or fully polynomial time) routine for the following Restricted problem (for the y- 
objective): Given an instance / E In, a (positive rational) bound C and a parameter 5 > 0, either return a so- 
lution point s satisfying x (s) < C and y (s) < (1 + 5)-min {y over all solutions s E S(I) having x(s) < C} 
or correctly report that there does not exist any solution s such that x (s) < C. For simplicity, we will drop 
the instance from the notation and use Restrict^ (y, x < C) to denote the solution returned by the corre- 
sponding routine. If the routine does not return a solution, we will say that it returns NO. We say that a 
routine Restrict^ (y, x < C) runs in polynomial time (resp. fully polynomial time) if its running time is 
polynomial in |/| and |C| (resp. |/|, \C\, \5\ and 1/5). The Restricted problem for the x-objective is defined 
analogously. We will also use the Restricted routine with strict inequality bounds; it is easy to see that they 
are polynomially equivalent. 

Note that in general the two objectives could be nonlinear and completely unrelated. Moreover, it is 
possible that a bi-objective problem possesses a (fully) polynomial Restricted routine for the one objective, 
but not for the other. The considered class of bi-objective problems is quite broad and contains many well- 
studied natural ones, most notably the bi-objective shortest path and spanning tree problems (see Section 1331 
for a more detailed list of applications). 

The structure of this section is as follows: In Section |3~T1 we show that, even if the given bi-objective 
problem possesses a fully polynomial Restricted routine for both objectives, no generic algorithm can guar- 
antee an approximation ratio better than 2. (This lower bound applies a fortiori if the Restricted routine 
is available for one objective only.) Furthermore, we show that for two such natural problems, namely, 
the bi-objective shortest path and spanning tree problems, it is NP-hard to do better than 2. In Section 1331 
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we give a matching upper bound: we present an efficient 2-approximation algorithm that applies to all of 
the problems that possess a polynomial Restricted routine for one of the two objectives. In Section [331 we 
discuss some applications. 

3.1 Lower bound 

To prove a lower bound for a generic procedure, we present two Pareto sets which are indistinguishable 
from each other using the Restricted routine as a black box, yet whose smallest e-Pareto sets are of different 
sizes. 

Proposition 3.1. Consider the class of bi-objective problems that possess a fully polynomial Restricted 
routine for both objectives. Then, for any e > 0, there is no polynomial time generic algorithm that approx- 
imates the size of the smallest e-Pareto set P* to a factor better than 2. 



Proof. Fix a rational e > and consider the following set of points: p = {x(p),y(p)), q = \x(p) 



r = (^,y(p)^r),Pg = (x(p) + l,y(p)(l-^)) and Pr = (x(p)(l - y(p) + l), where 



x(p),y(p) > 1 + i (Figured)). Let P = {p,q,r,p q ,p r } and P' = {q,r,p q ,p r } be the feasible (solution) 
sets corresponding to two input instances. Note that p (1 + e)-covers all the points, p q does not (1 + e)-cover 
r (due to the x coordinate) and p r does not (1 + e)-cover q (due to the y coordinate). It is easy to see that 
the smallest e - Pareto set for P consists of only one point (namely point p), while the smallest e - Pareto set 
for P' must include two points. 

We can show that a generic algorithm is guaranteed to tell the difference between P and P' only if 1/6 
is exponential in the size of the input. The argument is very similar to the proof of Theorem 1 in HVYL Let 
x(p) = y(p) = M, where M is an integer value exponential in the size of the input and 1/e. By exploiting 
the fact that, in some cases, our primitive is not uniquely defined, we can argue that a polynomial time 
generic algorithm cannot distinguish between instances P and P'. More specifically, a generic algorithm is 
guaranteed to tell the difference between P and P' only if the tolerance S is inverse exponential in the size 
of the input. 

First, note that both points q and r can be efficiently computed by appropriately using the given rou- 
tine; these two points suffice to (1 + e)-cover the feasible set in both cases. Distinguishing between the 
two instances means determining whether p is part of the solution. Assume that we use the operation 
Restrict^x, y < C), where C £ [y{p),y{p r ))- It is easy to see that this is the only "meaningful" operation 
using this routine as a black box. Then, even if p is part of the solution, by definition, Restrict^ can return p q 
as long as x(p q ) < (1 + 8)x(p) or equivalently 5 > jj. But since we want a polynomial time algorithm, | 
has to be polynomial in lg M ; hence, the latter constraint must hold. By symmetry, the same property holds 
for the Restrict^?/, •) routine. Therefore, using each of these routines as a black box, a polynomial time 
algorithm cannot determine if p is part of the solution, and it is thus forced to take at least two points, even 
when it is presented with the set P. Note that the above configuration can be replicated to show that it is 
impossible for a generic algorithm to determine whether the smallest e-Pareto set has k points or 2k points 
are needed. ■ 

In fact, we can prove something stronger (assuming P ^ NP) for the bi-objective shortest path (BSP) and 
spanning tree (BST) problems. In the BSP problem, we are given a (directed or undirected) graph, positive 
rational "costs" and "delays" for each edge and two specified nodes s and t. The set of feasible solutions is 
the set of s — t paths. The objectives (to be minimized) are linear, i.e. the "total weight" of a path equals the 
sum of the weights of its edges. The BST problem is defined analogously. These problems are well-known 
to possess polynomial Restricted routines for both objectives HLR1 IGRL We show the following: 
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Figure 1 : A polynomial time generic algorithm cannot determine if p is a solution of the given instance. 

Theorem 3.2. a. For the BSP problem, for any kfrom k = 1 to a polynomial, it is NP-hard to distinguish 
the case that the minimum size OPT e of the optimal e-Pareto set is kfrom the case that it is 2k — 1. 
b. The same holds for the BST problem for any fixed k. 

Proof. The reductions are from the Partition problem HGJIk we are given a set A of n positive integers 
A = {ai, a,2, ■ ■ ■ , a n }, and we wish to determine whether it is possible to partition A into two subsets with 
equal sum. 

a. For simplicity, we first prove the theorem for k = 1 and then generalize the construction. Given an 
instance of the Partition problem, we construct an instance of the BSP problem as follows: Let G be a graph 
with n + 1 nodes V{, i G [n + 1] and 2n edges {ej, e'j}, j E [n]. We attach the pair of (parallel) edges 
{ej, e'j} from V{ to t>i+i, i G [n] and set s = v\ and t = v n+ \. We now specify the two cost functions c(-) 
and d(-) on the edges: c(ej) = d(e' i ) = S + 2eajn and d{e{) = c(e-) = S, where S = Y^=i a i- 

Clearly, this simple transformation defines a bijection between subsets of [n] and s — t paths in G; the 
set J C [n] is mapped to the s - t path Vj = Ui e j{ e J u U^j{ e 'J- Srnce c (^./) = nS + 2en(J] ie j a-i) 
and d(Vj) = nS + 2en(^^j a^), each s — t path satisfies the equation c(V) + d(V) = 2(1 + e)^; 
hence, all feasible solutions are undominated. 

Now observe that two solution points suffice to (1 + e)-cover the feasible set; just pick the ("extreme") 
points r = ((1 + 2e)Sn, Sn), I = (Sn, (1 + 2e)Sn), corresponding to the s — t paths V\ n \ = Ur=i{ e «} 
and "Pg = UiLi{ e '*} respectively. Indeed, r (1 + e)-covers all the points having cost (x-coordinate) at least 
(1 + e)<Sn (since (1 + 2e)/(l + e) < 1 + e). Equivalently, it (1 + e)-covers all the solution points having 
delay (y-coordinate) up to (1 + e)Sn (since all the solutions lie on the line segment x + y = 2(1 + e)nS). 
Moreover, the point I (1 + e)-covers all the solution points having y-coordinate at least (1 + e)Sn. 

Since for each feasible solution V it holds mm{c(V), d(V)} > nS (and the "extreme" paths have cost 
or delay equal to nS), it follows that there exists an e-Pareto set containing (exactly) one point if and only if 
there exists a path in G with coordinates ((1 + e)Sn, (1 + e)Sn). It is immediate to verify that such a path 
exists if and only if there is a solution to the original instance of the Partition problem. 

Note that the above part of the proof does not rule out the possibility of an efficient additive approxima- 
tion algorithm, i.e. an algorithm that outputs an e-Pareto set of cardinality at most OPT e + a, where a is 
an absolute constant. We can rule this out as follows: Intuitively, we can think of the Pareto set of G as a 
"cluster". To prove the theorem for k > 1, the goal is to construct an instance of the problem such that the 
corresponding Pareto set consists of k such clusters that are "(1 + e)-far" from each other, i.e. no point in a 



cluster (1 + e)-covers any point in a different cluster. 

For the BSP problem, we can generalize the proof to hold for any k = poly(re, ^27=1 l°s( a «)) an( ^ f° r 
all e > 0. This can be achieved by exploiting the combinatorial structure of the problem; we essentially 
replicate the graph G k times and appropriately scale the weights. 

Formally, consider k (disjoint) copies of the graph G, G 3 = (V 3 ,E 3 ), j G [k], with V 3 ' = (J^l W% } 
and E 3 = U^ =1 {e^ , e'\}. Add a (source) node s, a (sink) node t; for each j add an edge from s to v{ and 
one from v J n+1 to t. That is, construct the graph H = (Vh, Eh) (see Figure |2) with 

k k 

V H = {s,t} U [J V 3 and E H = \J{(s,v{)UE 3 U {v 3 n+1 ,t)} 

3=1 3=1 

Assign zero cost and delay to each edge incident to s or fj] and set: 

(1 + 2e) 2 ^c(e{) = d(e'{)/(l + 2e) 2 ^ 1 ) =S + 2ea in 
(l + 2e) 2 ^c(e'l) = d(el)/(l + 2e) 2 ^ =S 

From the above equations, it follows that for each s — t path V 3 "using" graph G 3 , j G [k], it holds: 

(1 + 2e) 2{j - 1) c(V j ) + d{V j )/(l + 2e) 2(j - 1} = 2(1 + e)nS 

This implies that all feasible solutions are undominated. In particular, the Pareto set for this instance 
is the union of k disjoint "clusters" with endpoints lj = ^ ( 1+2 f)"u-i) > Sn(l + 2e) 2 ^'~ 1 ^ +1 ^ and r,j = 

( (l+Qe^S'-i)-! 1 Sn{l + 2e) 2 ^ -1 ^ , j G [k]. The solution points in each cluster lie on the line segment l 3 Tj. 
(The objective space for this instance is illustrated in Figure [3]) 

Now notice that no solution point corresponding to an s — t path using graph G^ is (1 + e) -covered by 
any point corresponding to an s — t path using graph G l for j ^ I. Indeed, due to the structure of the Pareto 
set, it suffices to check that, for each j G [k — 1], the points lj and r J+ i do not (1 + e)-cover each other. 
This holds by construction: rj+i is a factor of (1 + 2e) to the left and (1 + 2e) above lj. Therefore, any two 
clusters are "(1 + e)-far" from each other. Thus, any e-Pareto set for this instance must contain at least k 
points. 

As in the case of k = 1, for all j G [k], the solution points lj and r_, (1 + e)-cover the (solution 
points in the) jth cluster. Thus, 2k solution points suffice to (1 + e)-cover the feasible set. Also, the jth 
cluster is (1 + e)-covered by one point if and only if there exists an s — t path in H with coordinates 

m j = ( (i+2^-i) ' ( X + e ) 5 ' n ( 1 + 2e) 2(j ~ 1) ). Similarly, this holds if and only if the original Partition 
instance is a Yes instance. So, if there exists a partition of the set A, the smallest e-Pareto set contains 
exactly k points. Otherwise, the smallest such set must contain 2k points. 

To finish the proof, we observe that there exists an e-Pareto set with (at most) 2k — I points if and only 
if there exists an e-Pareto set with exactly k points. Indeed, the former statement holds if and only if some 
cluster is (1 + e)-covered by one point, i.e. if and only if there exists an s — t path in H with coordinates 
rrij for some j G [k] , which in turn holds if and only if the original Partition instance is a Yes instance. The 
latter holds if and only if the smallest e-Pareto set contains exactly k points. 

b. In the BST problem, we are given an undirected graph G = (V, E), positive rational "costs" c(e) and 
"delays" d(e) for each edge e G E. The set of feasible solutions is the set of spanning trees of G; the goal is 
to minimize cost and delay. For k = 1, the proof for the BST problem is identical to the proof for the BSP 
problem. 

^For simplicity, we allow zero weights on the edges, since there does not exist any s — t path with zero total cost or delay. This 
can be easily removed by appropriate perturbation of the weights. 
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Figure 2: Graphs in the reduction of Theorem [3^2] 



We give a construction that works for any fixe d k and for sufficiently small e; in fact e = 0(1/ k) 
suffices. Consider the graph G' obtained from G by adding one node v$ connected to v± with k parallel 
edges gi, i G [k]. Subtract the value S from all the weights of G and set: c(gi) = {2 — (1 + 2e) 2l }Sn, 
d(gi) = (1 + 2e) 2l Sn. (These edges play the role of offsets.) Clearly, as long as (1 + 2e) 2k < 2, all the 
weights are in the interval (0, 2Sn). It is also not hard to see that, under this restriction, the Pareto set of G' 
contains k clusters (having 0(2 n ) undominated points each) that are "(1 + e)-far" from each other - in the 
sense defined above. 

The points of the i-th cluster, i S [k], lie on the line segment c(V)+d(V) = 2(l+e)nS with "endpoints" 
(c(gi),d(gi) + 2eSn) and (c(gi) + 2eSn,d(gi)); the latter (solution) points suffice to (1 + e)-cover the 
corresponding cluster. It is easy to see that there exists a solution with coordinates ((l+e)c(gi), (l+e)d(gi)) 
- i.e. a solution that (l+e)-covers the cluster - if and only if there exists a subset of A with sum (l+2e) 2t S/2. 

To complete the proof, we use the fact that the following variant of the Subset Sum problem is NP-hard: 
Given A = {a±, a2, ... , a n } with the property that (i) either there exist k subsets Ai C A, i G [k], such 
that J2xeAi x = 7*^/2 or &) no sucn subset exists, decide which one of the two cases holds (for any fixed 
integer k and rational 7 > 1 such that j k < 2). (This can be shown by a reduction from the Partition 
problem.) Therefore, it is NP-hard to decide if the smallest e-Pareto set for the instance has k points or 2k 
points are needed. ■ 

Remark 3.3. For k = 1 the theoremark says that it is NP-hard to decide if one point suffices or we need at 
least 2 points for an e-approximation. We proved that the theorem holds also for more general k to rule out 
additive and asymptotic approximations. We can easily modify the proof so that the graphs in the reductions 
are simple. For the BSP problem, this can be achieved by inserting a new ("dummy") node in the "middle" of 
each parallel edge (subdividing the weights arbitrarily). For the BST problem, this does not suffice, because 
all the additional nodes must be covered (by a spanning tree). Let Wi be the node inserted in the middle 
of &i = (vi,Vi + i). The problem is solved by setting c((v{,Wi)) = d((vi,Wi)) = 0, c((wi, Wt+i)) = c(ej) 
and d((wi,Vi + i)) = d(ei). By scaling the weights of the Partition instance we can see that the NP-hardness 
holds even in the case where all the edge weights are restricted to be positive integers. Similar hardness 
results can be shown for several other related problems (see Section |33T ). 
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3.2 Two Objectives Algorithm 

We have a bi-objective problem with an associated Restricted routine for the y-objective that runs in poly- 
nomial (or fully polynomial) time. We are given an instance and an e, and we wish to construct an e-Pareto 
set of as small size as possible. In this subsection, we present a generic algorithm that guarantees ratio 2. 
By the result of the previous subsection, this factor is optimal. Recall that the algorithm in MVYI works for 
all problems in MPTAS and is a factor 3 approximation. (The analysis of the latter algorithm is tight for 
the class of problems considered here.) In Section [3.2. II we show that a straightforward greedy approach 
cannot guarantee a ratio better than 3 in our setting. We next make a crucial observation that is exploited in 
Section l3.2.2l to achieve the optimal factor. 

3.2.1 The Greedy Approach Fails 

We remark that if the underlying problem has polynomial time exact Restricted routines for both objectives 
(i.e. Restrict^ for 5 = 0), then we can efficiently compute the optimal e-Pareto set by a simple greedy 
algorithm. The algorithm is similar to the one given in HKP1 IVYl for the (special) case where all the solution 
points are given explicitly in the input. We denote by x m i n , y m j n the minimum values of the objectives 
in each dimension. The greedy algorithm proceeds by iteratively selecting points q\ , . . . , q^ in decreasing 
x (increasing y) as follows: We start by computing a point q[ having minimum y coordinate among all 
feasible solutions (i.e. y(q[) = y m m)', Qi is then selected to be the leftmost solution point satisfying y{q\) < 
(1 + e)y{q' l ). During the jth iteration (j > 2) we initially compute the point q'j with minimum y-coordinate 
among all solution points s having x(s) < + e) and select as qj the leftmost point which 

satisfies y(qj) < (1 + e)y(q'j). The algorithm terminates when the last point selected (1 + e)-covers the 
leftmost solution point(s) (i.e. the point(s) q having x(q) = £ m in)- It follows by an easy induction that 
the set {gi, (fe, ...,<&} is an e-Pareto set of minimum cardinality. (This exact algorithm is applicable to 
bi-objective linear programming and all problems reducible to it, for example bi-objective flows, the bi- 
objective global min-cut problem BAZII and several scheduling problems MCJKL For these problems we can 
compute an e-Pareto set of minimum cardinality.) 

If we have approximate Restricted routines, one may try to modify the greedy algorithm in a straightfor- 
ward way to take into account the fact that the routines are not exact. However, as shown below, this modified 
greedy algorithm is suboptimal, in particular it does not improve on the factor 3 that can be obtained from 
the general GAP routine. More care is required to achieve a factor 2, matching the lower bound. 

Suppose that we have a (fully) polynomial Restrict^ routine (even for both objectives). Consider the 
following scheme, where 5 is the "uncertainty parameter" - 5 < e, but 1/5 must be polynomially bounded 
in the size of the input and 1/e, so that the overall algorithm runs in polynomial time: 

Algorithm Greedy 

Compute y min and x min . 

Vi = 2/min(l + e); 

qi = Restrict^,?/ < yi); 

Q = {<?i}; i = i; 

While (x min < x( qi )/(l + e)) do 

{ q' i+l = Restrict^, x < x(qi)/(l + e)); 

Vi+i = (1 + e) • max{|/j, y{q' i+l )/ (1 + 5)}; 

q i+ i = Restricts (x,y < y i+ i); 

Q = QU {q i+1 }; 

i = i + 1; } 
Return Q. 
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Figure 4: Illustration of the worst-case performance of the greedy approach. 



Since the Restricted routines are now approximate, in order to guarantee that the output set of points is 
an e-Pareto set, we had to appropriately modify the algorithm based on the parameter 5. More specifically, 
note that the point q' i+l can have y-coordinate up to {1 + 5) times the minimum y over all points s satisfying 
x(s) < x(qi)/(\ + e). In other words, there may exist a solution point I satisfying x(s) < x{qi)/{l + e) 
and y(s) = y(q' i+1 ) /(l + 5). (The algorithm has "no way of knowing this" unless it uses a value of 5 with 
1/5 exponential in the size of the input.) This "uncertainty" forces the algorithm to select as point qi + \ 
the leftmost point that satisfies y(qi+i) < (1 + e)y{q' i+1 ) / {1 + 5). Due to this "weakness", we have the 
following: 

Claim 3.4. For any 5 > 0, with 1/5 polynomial in the size of the input and 1/e, there exist instances on 
which the greedy algorithm above outputs a set Q such that \Q\ = 3k — 1, where k = OPT e . 

Proof. Denote by P* = {p\, . . . ,p* k } the optimal set, where its points p*, i G [k] are ordered in decreasing 
order of their x-coordinate, and Q = {qi, ■ ■ ■ ,q r } the set selected by the greedy algorithm. By exploiting 
the uncertainty introduced by the parameter 5, we describe an adversarial scenario such that r = 3k — 1. 

The idea is the following: Consider the subroutine call q' i+1 = Restrict^?/, x < x(qi)/(l + e)). By 
definition, we have y < y(q' i+1 ) < (1 + 5)y, where y = min{y(s) | x(s) < x(qi)/(l + e)}. Suppose 
that the routine returns a point q' i+l satisfying y = y(q' i+1 ). Call this condition (f ). If q' i+1 satisfies this 
condition, the optimal point p* (1 + e)-covering q' i+1 can have y-coordinate up to (1 + e)y(q' i+1 ), while the 
algorithm is forced to select a point qi + \ with y-value at most (1 + e)y(q' i+l )/ (1 + 5). 

We refer the reader to Figure |4]for an illustration. In the instance presented there, the rightmost optimal 
pointp^ (l + e)-covers all the solution points that are (l + e)-covered by the set {qi, (72}, while, for j > 2, the 
optimal point p* (1 + e)-covers all the solution points that are (1 + e)-covered by the set {qsj , q?,j+i, q?,j+2} ■ 
This proves the desired claim. In the following, we explain the situation in detail. 

Consider the first point q\ G Q selected by the algorithm. By the definition of the Restricted routine 
and the fact that q[ must be (1 + e)-covered by p\, it follows that x{p\) > x(qi) /(l + 5). Suppose that the 
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following scenario occurs: x(p\) = x(q\)/(l + 6), x(q\)/[(l + e)(l + 6)] < x{q2) < x(q\)/(l + e) and 
there are no solutions with x-coordinate in the interval [^(52) /(I + e) > X (<12)) ■ Then, the point p* (1 + e)- 
covers all solutions that are (1 + e)-covered by the set {qi, q 2 }. Notice that the algorithm only "loses" one 
additional point here; we have that x(q 2 ) < x(p\). This is due to the fact that we can exactly compute the 
minimum y-coordinate. However, since this does not hold for the next iterations, the algorithm can "lose" 
two additional points for each optimal point. 

Now suppose that the points {p^, q%, q' 3 , q^, q' 4 } satisfy the following scenario: q 3 satisfies condition (t), 
y(q 3 ) = [(1 + e)/(l + 5)}y(q' 3 ), y{q' A ) = (1 + 8)y{q 3 ), x(q A ) = (1 + 5)x(p* 2 ) and y(p* 2 ) = y{q' A ). It is easy 
to see that these conditions are simultaneously realizable. (Observe that p 2 (1 + e) -covers q' 3 .) Finally, if 
x(qi)/[(\ + e){l + 5)\ < x{q§) < x{q^)/{l-\-e) and there are no solutions with ^-coordinates in the interval 
[x(qs)/(l + e),x(q§)), the point p 2 (1 + e)-covers all the solutions (1 + e)-covered by the set {q 3 , q<±, q§}. 

By replicating the above described configuration, it follows inductively that p* +1 (1 + e)-covers all the 
solutions (1 + e)-covered by {q 3 i, qu-\-\, q 3 i +2 }. This completes the proof. ■ 

In fact, one can show that the the greedy algorithm guarantees a factor 3, i.e. the above described 
adversarial scenario represents a worst-case instance for the algorithm. Let us now try to understand why 
the greedy approach fails to guarantee a factor 2 in the aforementioned scenario. The problem is that, due to 
the uncertainty introduced by 6, the point p* 2 can lie arbitrarily to the left of q 3 . Thus, the only invariant that 
the greedy algorithm can guarantee is x {q±) < (1 + S)x(p 2 ). 

We can overcome this obstacle by exploiting an additional structural property of the considered class of 
bi-objective problems. In particular, our generic algorithm will also use a polynomial routine for the follow- 
ing Dual Restricted problem (for the x-objective): Given an instance, a (rational) bound D and 6 > 0, either 
return a solution s satisfying y (s) < (1 + S) D and x (s) < min {x(s) over all solutions s having y(s) < D} 
or correctly report that there does not exist any solution s such that y (s) < D. Similarly, we drop the 
instance from the notation and use DualRestrict^ (x, y < D) to denote the solution returned by the corre- 
sponding routine. If the routine does not return a solution, we will say that it returns NO. We say that the 
corresponding routine runs in polynomial time (resp. fully polynomial time) if its running time is polynomial 
in |/| and \D\ (resp. |7|, \D\, \5\ and 1/5). 

The following lemma establishes the fact that any bi-objective problem that possesses a (fully) polyno- 
mial Restricted routine for the one objective, also possesses a (fully) polynomial Dual Restricted routine for 
the other. 

Lemma 3.5. For any bi-objective optimization problem, the problems Restrict $ (y, •) and DualRe strict s (x, •) 
are polynomially equivalent. 

Proof. The proof of (both directions of) this equivalence uses binary search on the range of values of one 
objective with an application of the polynomial routine (for the other objective) at each step of the search. 
Let m be an upper bound on the number of bits in the objectives; recall that m is polynomially bounded in 
the size of the instance. Observe that (the absolute value of) the minimum possible difference between the 
objective values of any two solutions is at least 2~ 2m . 

First, we argue that a polynomial time algorithm for Restricts (y, x < C) can be used as a black box to 
obtain a polynomial time algorithm for DualRestrict^x, y < D). 

Given an upper bound D and a (rational) error tolerance 5 > 0, the following algorithm computes the 
function DualRestrict^x, y < D): 

1. If Restrict^?/, x < 2 m ) returns a solution so having y(so) > (1 + 5)D or returns "NO", then output 
"NO". 

2. Otherwise, do a binary search on the parameter C in the range [2 _m , 2 m ] calling Restrict^, x < C) 
in each step, until you find a value C such that: 
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(a) Restrict^?/, x < C) returns a solution s satisfying x(s) < C and y(s) < (1 + S)D. 

(b) Restrict^y, x < C — 2~ 2m ) either returns a solution s' having x(s') < C — 2~ 2m and y(s') > 
(1 + 5) Dor returns "NO". 

Output the solution s. 

The number of calls to the routine Restrict^y, x < C) is ©(to), so the overall algorithm runs in 
polynomial time. It remains to argue about the correctness. In case 1, either there are no feasible solutions 
or all solutions have y coordinate strictly greater than D. In case 2, all solutions s having x(s) < C — 2~ 2m 
also satisfy y(s) > D. Since there are no solutions with x coordinate strictly between x(s) and C — 2~ 2m , 
C < min{x over all solution points s having y(s) < D}. 

Conversely, given an upper bound C and a (rational) error tolerance 5 > 0, the following algorithm 
computes the function Restricts (y, x < C) using as a black box an algorithm for DualRestrict^ (x, y < D): 

1. If DualRestrict^x, y < 2 m ) returns a solution sq having x(sq) > C or returns "NO", then output 
"NO". 

2. Otherwise, do a binary search on the parameter D in the range [2~ m , 2 m ] calling DualRestrict^x, y < 
D) in each step, until you find a value D such that: 

(a) DualRestrict5(x, y < D) returns a solution s satisfying x(s) < C and y(s) < (1 + S)D. 

(b) DualRestrict5(x, y < D — 2~ 2m ) either returns a solution s' having x(s') > C (and y(s') < 
(1 + 5)(D- 2~ 2m ) or returns "NO". 

Output the solution s. 

The justification is similar. The number of calls to the routine DualRestrict^x, y < D) is ©(to), so 
the overall running time is polynomial. For the correctness, in case 1, either there are no feasible solutions 
or all solutions have x coordinate strictly greater than C. In case 2, all solutions s having y(s) < D — 2~ 2m 
also satisfy x(s) > C. Since there are no solutions with y coordinate strictly between y(s) and D — 2~ 2m , 
D < mm{x over all solution points s having x(s) < D}. ■ 

3.2.2 Algorithm Description 

We first give a high-level overview of the 2-approximation algorithm. The algorithm iteratively selects a 
set of solution points {q±, . . . , q r } (in decreasing x) by judiciously combining the two routines. The idea 
is, in addition to the Restricted routine (for the y-coordinate), to use the Dual Restricted routine (for the 
x-coordinate) in a way that circumvents the problems previously identified for the greedy algorithm. More 
specifically, after computing the point q\ in essentially the same way as the greedy algorithm, we proceed 
as follows. We select as q, L a point that: (i) has y-coordinate at most (1 + e)y{q' i )/{l + 8) and (ii) has 
x-coordinate at most the minimum x over all solutions s with y(s) < (1 + e)y(q' i )/(l + 6) 2 for a suitable 
5. This can be done by a call to the Dual Restricted routine for the x-objective. Intuitively this selection 
means that we give some "slack" in the y-coordinate to "gain" some slack in the x-coordinate. Also notice 
that, by selecting the point qi in this manner, there may exist solution points with y-values in the interval 
((1 + e)y{q' i ) /(l + 5) 2 , (1 + e)y{q' i ) /(l + 5)] whose x-coordinate is arbitrarily smaller than x{qj). In fact, 
the optimal point (1 + e) -covering q^ can be such a point. However, it turns out that this is sufficient for 
our purposes and, if 5 is chosen appropriately, this scheme can guarantee that the point q2% lies to the left 
(or has the same x-value) of the i-th rightmost point of the optimal solution. We now proceed with the 
formal description of the algorithm. In what follows, the error tolerance is set to 5 = -^1 + e — 1 (ps e/3 
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for small e). (For the case that the Restricted routine is available for both objectives, we have a variant of 
this algorithm that achieves a ratio of 2 and is slightly more efficient in the sense that it uses error tolerance 
5' = \/l + e — 1.) If v^l + e is not rational, we let 5 be a rational that approximates v / T+~e — 1 from below, 
i.e. (1 + <5) 3 < (1 + e), and which has representation size \5\ = O (|e|) (i.e. number of bits in the numerator 
and denominator). The set of points computed by the algorithm is shown in Figure [5] 

Algorithm 2-Approximation 

If Restrict 5o ^i(y,x < 2 m ) = NO then halt. 
q[ = Restrict s (y,x < 2 m ); 

<7Mt = DualRestrict,5 ^i(x, y < 2 m ); x min = x(g left ); 

y 1 = y( q ' 1 )(l + S); 

qi = DualRestrict^x, y < yi); 

x\ = x(qi)/(l + e); 

Q = {<?i}; i = i; 

While (xi > Xmin) do 

{ q' i+1 = Restrict^, x < x*); 

y l+1 = [(1 + e)/(l + S)\ • maxfe, y(q' l+1 )/(l + 5)}; 

q i+ i = DualRestrict 5 (x,y < m+i); 

x i+ i = x(q i+ i)/(l + e); 

Q = Q U {q i+1 }; 

i = i + 1; } 
Return Q. 

3.2.3 Analysis 

Recall that 2 m is an upper bound on the values of the objectives. Thus, if Restrict^^i (y, x < 2 m ) = NO, 
there are no feasible solutions, in which case we can just terminate the algorithm. So, we can assume that the 
solution set is nonempty. In this case, the subroutine calls of lines 2 and 3 indeed return a solution; moreover, 
(/) the solution point gi e ft has minimum x-value among all feasible solutions and (ii) q[ has y-value at most 
(1 + S)y m - m . Now observe that y m j n < y~i < yi+i and Xj > x m i n for all the values of i for which the body 
of the while loop is executed. It is thus easy to see that each subroutine call returns a point; so, all the points 
are well-defined. 

Let Q = {qi, q2, . . . , q r } be the set of solution points produced by the algorithm. We will prove that the 
set Q is an e-Pareto set whose size is at most twice the optimum. We note the following simple properties. 

Fact 3.6. We have the following. ■ 

1. For each i G [r — 1] it holds (i) x(q' i+1 ) < x((/j)/(l + e) and (ii) for each solution point t with 
x(t) < x(g f )/(l + e), we have y(t) > y{q' i+1 )/{l + 5). 

2. For each i G [r] it holds (i) y(qi) < (1 + S)yi and (ii) for each solution point t with y(t) < y~i we have 
x(t) > x(qi). 

Proof. The properties are just restatements of the definition of the two subroutines. ■ 

We can now prove the following lemmata (all properties used below refer to the above fact). 

Lemma 3.7. The x coordinates of the points q±, q2, . . . , q r of Q form a strictly decreasing sequence. 

Proof. Consider two successive elements qi, qi + \ of Q. For their x coordinates we will argue that x(qi + \) < 
x(qi)/(l + e). First observe that y(q' i+l ) < y%+\. So, property 2-(ii) implies that x^+i) < x(q' i+x ). Now 
from property l-(/) we get x(q' i+1 ) < x(qi)/ (1 + e) and the argument is complete. ■ 
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The following lemma shows that Q is indeed an e-Pareto set. 

Lemma 3.8. 1. The point q\ (1 + e)-covers all of the solution points that have x-coordinate at least 
s(gi)/(l + e). 

2. For each i £ [r] \ {1} the point qi (1 + e)-covers all of the solution points that have their x-coordinate in 
the interval [x(qi)/(l + e), x(qi-i)/(l + e)). 

3. There are no solution points with x-coordinate smaller than x(q r ) /(l + e). 

Proof. 1. Let t be a solution point with x(t) > x(q{)/(l + e). We need to show that t is (1 + e)-covered 
by qi. It clearly suffices to argue that y(t) > y(q\)/(\ + e). Indeed, by property 2-(ii) we have y{q\) < 
(l + 5)y\ = {l + 5) 2 y(q'i) and the definition of q[ implies that y(t) > y(q[)/ (1 + 5), for any solution point t. 
By combining these facts we get that for any solution point t it holds y(t) > y(qi)/(l + 5) 3 > y(q\) / '(1 + e). 

2. Let t be a solution point satisfying x(qi)/(l + e) < x(t) < x(qi^i)/(l + e); we will show that t is 
(1 + e)-covered by qi or equivalently that y(t) > y(qi)/(l + e). The proof is by contradiction. Suppose 
that there exists such a point t with y(t) < y(qi)/(l + e). By property 2-(i) and the definition of yi 
this implies y(t) < max{yi^.i,y(q^)/(l + 5)}. Now since x(t) < + e), property l-(n') gives 
y{t) > y{<l'i)/{^ + S). Furthermore, since x(t) < x(qi-i), by property 2-(ii) it follows that y(t) > y~i~\. 
This provides the desired contradiction. 

3. The termination condition of the algorithm is x(q r )/ (1 + e) < x m - m . ■ 

Remark 3.9. We show in Lemma l3. lOl below that the set Q is of cardinality \Q\ < 20PT e . So, the algorithm 
could output this set of points. However, we observe that the set Q may contain "redundant" points: The 
y-coordinates of the points q\, . . . , q r do not necessarily form an increasing sequence. In fact, if y(qi + i) < 
(1 + 5)y~i, it may happen that y(qi + i) < y(q-i) (in which case the point yi is redundant). (Note however that 
if y(qi + i) > (1 + 5)y~i, then by property 2-(i) we get y(qi + i) > y(qi).) This observation can be further 
exploited for a post-processing step. For example, if y(q2i) < (1 + <5)i/2i-i> we can sa fely discard the point 
<72i-i as implied by (the proof of) Lemma l3~8l 

We now bound the size of the set of points Q in terms of the size of the optimal e-Pareto set. 

Lemma 3.10. Let P* = {p\,P2, ■ ■ ■ be the optimal e-Pareto set, where its points p*, i £ [k], are 
ordered in (strictly) increasing order of their y- and (strictly) decreasing order of their x-coordinate. Then, 
\Q\=r< 2k. 

Proof. We prove the following: 

Claim 3.11. If the algorithm selects a solution point q2i-i (i-c. if2i — 1 < r), then there must exist a point 
p* in P* (i.e. it holds i < k) and if the algorithm selects a point q2i, then x(p*) > x(q2i). 

The desired result follows directly from this. The claim is proved by induction on i. 

Basis (i = 1). The first statement of the claim trivially holds. To show the validity of the second 
statement observe that for the rightmost point of P*, we must have y (p\) < y(q[)(l + e) = yi(l + e)/(l + 
5) < V2- The first inequality holds since the solution point q[ must be (1 + e)-covered by P* and in particular 
by the point of P* having the minimum y-coordinate. The two other inequalities follow from the definitions 
of y\ and y~2- Now an application of property 2-(ii) gives x(p\) > x(q2) and the base case is proved. 

Induction step. Suppose that the claim holds for index i— 1 (more specifically that x(p*_ l ) > x(q2i-2)Y, 
we will prove it for i. We will prove each statement in turn. 

Assume first that the algorithm selects a point (fe-i (i-C- that 2i — 1 < r). We will show that P* 
contains a point p* (i.e. that i < k). By the termination condition of the algorithm, our assumption implies 
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that x{q2i-2) > (1 + e)^min- Therefore, by the induction hypothesis it follows that x(p*_ 1 ) > (1 + e)x m i n ; 
that is, point p*_ 1 does not (1 + e)-cover the leftmost solution point, which means there must exist a point 
p* in the optimal set. 

Now assume that the algorithm selects a point qn. We will show that x(p*) > x(q2i). First note that 
by property and the induction hypothesis x(q 2i _ 1 ) < x(p*_ 1 )/(l + e). So, the point p*_ 1 does not 
(1 + e)-cover the point g^i-i i n tne ^-coordinate. Clearly, the latter point must be (1 + e)-covered by a point 
in P*. Since the p*'s are sorted in decreasing order of their x-coordinates, we conclude that p* is the only 
eligible point for that purpose, i.e. g^i-i must be must (1 + e)-covered by p*. To complete the argument, 
we need the following fact: 

Fact 3.12. There does not exist any solution point t with x(t) < x{q2i) such that t (1 + e)-covers point 

Proof. We want to prove that for all solutions t having x(t) < x{q2%) it holds y(t) > (1 + e)y(q' 2i _i)- For 
such a solution point t we have y(t) > y~2i > 272i-i(l + e)/(l + £) > (l + e)y(q' 2i _ l ). The latter inequalities, 
in the order they appear, follow by applying property 2-(ii) and the definition of yj (for j = 2i — 1, 2i). ■ 

The above fact implies directly that x{p*) > x{q2i) and the proof is complete. ■ 

Thus far, we have proved that the set Q is an e-Pareto set of size \Q\ < 20PT e . We now analyze the 
running time of the algorithm. Let k be the number of points in the smallest e-Pareto set, k = OPT e . The 
algorithm involves r < 2k iterations of the while loop; each iteration involves two calls to the subroutines. 
Therefore, the total running time is bounded by Ak subroutine calls. In summary, we proved the following 
theorem. 

Theorem 3.13. The above described algorithm computes a 2-approximation to the smallest e-Pareto set in 
time 0(OPT e ) subroutine calls, where 1/5 = 0(l/e). 

3.3 Applications 

Our result can be applied to all of the problems which have a polynomial (or fully polynomial) time Re- 
stricted routine for one of the two objectives. It should be stressed that our algorithm is quite general; 
it does not assume for example linearity of the objectives. Applications include the shortest path prob- 
lem llHanl IWal lESZl ILRl and generalizations |EV1I(jR+1ICX21IW1 . cost-time trade-offs in query evalua- 
tion MPY2II . spanning trees (and more generally matroid problems, see below) MGRl IHIi and related prob- 
lems MCXi The aforementioned problems possess a polynomial Restricted routine for both objectives. In 
essence, for most of the aforementioned problems (with MPY21 being a notable exception), the two objec- 
tives are "the same" and we can efficiently optimize each of them separately. For several other problems 
HABK1IIABK2IICJK1IDJSSII . the Restricted routine is available for one objective only (because it is NP-hard 
to separately optimize this objective). An example is the following classical scheduling problem: We are 
given a set of n jobs and a fixed number m of machines. Executing job j on machine i requires time pij and 
incurs cost Cij. We are interested in the trade-off between makespan and cost. Minimizing the makespan is 
NP-hard, even for m = 2; hence, the Dual Restricted problem for this objective (equivalently, the Restricted 
problem for the cost objective) does not have a PTAS. If m is fixed, a fully polynomial time Dual Restricted 
routine for the cost objective is given in BABKli (By Lemma [331 this implies an FPTAS for the Restricted 
problem for the makespan objective.) 

For the bi-objective shortest path problem, a polynomial (resp. fully polynomial) Restricted routine 
corresponds to a polynomial (resp. fully polynomial) time approximation scheme for the Restricted Shortest 
Path problem: given a bound on the cost of the path, minimize the delay of the path subject to the bound 
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on the cost. This problem has been studied in a number of papers BHasl IWal ILRI ESZi The problem is 
NP-hard and has a fully polynomial time approximation scheme. The best current algorithms approximate 
the optimal restricted path within factor 1 + e in time 0(en/e) for acyclic (directed) graphs |ESZ|, and 
time 0(en(log log n + 1/e) for general (directed) graphs BLRH . where n is the number of nodes and e 
is the number of edges. Moreover, the Dual Restricted problem also admits an FPTAS with the same 
time complexity. Thus, our algorithm runs in 0(en(loglogn + l/e)OPT e ) time for general graphs and 
0(enOPT e /e) for acyclic graphs. The time complexity is comparable or better than previous algorithms 
BHanHWallTZIl . which furthermore do not provide any guarantees on the size. 

For the bi-objective spanning tree problem a polynomial Restricted routine corresponds to a polynomial 
time approximation scheme for the Constrained Spanning Tree (CST) problem [GR]: given a bound on the 
cost of the tree, minimize the delay of the tree subject to the bound on the cost. This problem is also NP- 
hard and is known to have a polynomial time approximation scheme BGRIIHIi . (In fact, the aforementioned 
papers provide a PTAS for the more general problem of finding a minimum cost base of a matroid subject 
to a bound on the total length, as long as there is a polynomial time independence oracle for the matroid.) 
The best current algorithm for the problem BHLH has running time 0((l/e) 1//<E n 3 ). As a corollary, our 
generic algorithm can compute a 2-approximation to the smallest e-Pareto set in time 0((l/e) 1 / £ n 3 OPT (E ). 
Whether such a 2-approximation can be computed in fully polynomial time is conditional on the existence 
of an FPTAS for the CST problem (which is an interesting open question). In contrast, by the results of 
HPYlllVYll . a 3-approximation can be computed in fully polynomial time. 

4 d Objectives 

The results in this section use the GAP routine and thus apply to all problems in MPTAS. 
4.1 Approximation of the optimal e-Pareto set. 

Recall that for d > 3 objectives we are forced to compute an e'-Pareto set, where e' > e, if we are to have a 
guarantee on its size IIVYII . For any e' > e, a logarithmic approximation for the problem is given in HVY1 . by 
a straightforward reduction to the Set Cover problem. We can sharpen this result, by exploiting additional 
properties of the corresponding set system. 

Theorem 4.1. 1. For any e' > e there exists a polynomial time generic algorithm that computes an e'- 
Pareto set Q such that \Q\ < 0(<ilog OPT e )OPT e . The algorithm uses 0((m/ 5) d ) GAP5 calls, where 
l/S = 0(l/(e'-e)). 

2. For d = 3, the algorithm outputs an e'-Pareto set Q satisfying \Q\ < c0PT o where c is a constant. 

Consider the following problem Q(P, e): Given a set of n points P C as input and e > 0, compute 
the smallest e-Pareto set of P. It should be stressed that, by definition, the set of points P is given explicitly 
in the input. (Note the major difference with our setting: for a typical multiobjective problem there are 
exponentially many solution points and they are not given explicitly.) This problem can be solved in linear 
time for d = 2 by a simple greedy algorithm. For d = 3 it is NP-hard and can be approximated within some 
(large) constant factor c HKPi If d is arbitrary (i.e. part of the input, e.g. d = n), the problem is hard to 
approximate better than within a SI (log n) factor (unless P = NP) IIVYII . 

The following simple fact, implicit in IIVYII . relates the approximability of Q with the problem of com- 
puting a small e'-Pareto set for a multiobjective problem IT, given the GAP primitive. Let e > be a 
given rational number. For any e' > e, we can find a 5 > such that 1/5 = 0(1/ (e' — e)) satisfying 
l + e'> (l + e)(l + cT) 2 . 
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Lemma 4.2. Suppose that there exists an r -factor approximation algorithm for Q. Then, for any e' > e, we 
can compute an e'-Pareto set Q, such that \Q\ < rOPT e using 0((m/5) d ) GAP5 calls. 

Proof. The algorithm proceeds in two phases; in the first phase, we compute a 5-Pareto set, by using the 
original algorithm of HPY11 and in the second phase we post-process the points produced by the latter 
algorithm by using the r-approximation algorithm for Q as a black box. 

For the given instance / G Ijj, let X(I) be the set of d-vectors of values of solutions in the objective 
space and fix an optimal e-Pareto set P* = P*(I). Let R be the 5-Pareto set produced in the first stage. We 
apply the r-approximation algorithm for Q on input R to produce a set R' C R that (1 + e)(l + 5)-covers 
R. (Since \R\ < (m/<5) d-1 , it follows that the overall algorithm runs in polynomial time.) R' is clearly 
an e'-Pareto set for the feasible set X(I). We will argue that \R'\ < rOPT e . Let R* denote the smallest 
(1 + e)(l + 5)-cover for R using only points from R; we have \R'\ < r\R*\. The following simple claim 
completes the argument: 

Claim 4.3. \R*\ < OPT e . 

Proof. It suffices to show that there exists an (1 + e)(l + 5)-cover C for R of cardinality at most OPT e . 
Since R is a <5-Pareto set, for any solution point s G 3L(I), there exists a solution point r G R that (1 + 5)- 
covers s. C is constructed as follows: For each s G P* pick an r G R that (1 + 5)-covers it. Then, 
|C| < \P*\ = OPT e . Every point r G R is (1 + e)-covered by a point s G P*, which in turn is (1 + 5)- 
covered by a point c G C. Therefore, C (1 + e)(l + 5)-covers all points of R. ■ 

■ 

Part 2 of Theorem 14.11 follows immediately from the fact that Q is constant factor approximable for 
d = 3 HKPl and Lemma l4~2l We consider the case of general d in the remainder. To proceed, we need the 
following definition. 

Definition 4.4. A set system is a pair (U, TV), where U is a set and TZ is a collection of subsets of U. For a 
set system (U, TV), we say that X C U is shattered by TZ if for any Y C X, there exists a set R G TZ with 
X n R = Y. The VC-dimension MVCII of the set system is the maximum size of any set shattered by TZ. Let 
T CUbe a finite set and r G (1, 00) be a parameter. A set N C T is called an 1 /r-net for (T, TZ) llHWl . if 
N n S + for all S G TZ having \S\ > \T\/r. 

The problem Q(P, e) can be formulated as a set cover problem as follows: For each point q G P and 
e > 0, define Sq i€ = {x G IR rf | q < (1 + e) • x}. S q:6 is the subset of R d that is (1 + e)-covered by q; it is a 
closed convex cone in M, d (a translation of the nonnegative orthant by the vector qj (1 + e)). For each point 
r G P, r is (1 + e)-covered by q if and only if r G 5 9>e . Now consider the set system F(P, e) = (P, S(P, e)), 
where S(P, e) = {P q , e = P (~) S q>e \ q G P}. Clearly, there is a bijection between set covers of T(P, e) and 
e-Pareto sets of P. We now establish the following: 

Lemma 4.5. a. For any finite set of points P C IR d and e > 0, it holds VC-dim(J r (P, e)) < d. 
b. There exists a set of points P such that VC-dim{J r {P, e)) = d. 

Proof, a. Let P be a set of points in R rf and e > 0. We must argue that no subset P' C P of cardinality 
d + 1 can be shattered by S(P,e). Note that any such set P' C P (of cardinality d + 1) contains a point 
r none of whose coordinates is minimal, that is, a point r such that for all i G [d] there exists some point 
q l G P 1 (different from r) with the property (q l )i < r^. We claim that we cannot "separate" r from the 
remaining points of P' by any convex cone (as defined above). Indeed, a point that (1 + e)-covers the q l 's is 
guaranteed to (1 + e)-cover r (or equivalently, the "dichotomy" {q % , i G [d]} cannot be realized). 

b. Consider a set P = A U C, where \A\ = d and |C| = 2 d . Let A = {ai, . . . , a d }. We select the 
ai s in A as follows: For each % G [<i], the ith coordinate of is equal to 1 and all the rest are equal to 
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1 + 2e. The set A has two properties: (i) no two of its points (1 + e)-cover each other and (ii) for any 
two points p, q G A, we have argmin^ 7^ argmin^j. The set C is selected such that each subset of A is 
(1 + e)-covered by some point in C. In particular, let X = UieJ(x){ a *i ^ e a surj set of A. We add the point 
cx in C having each coordinate indexed by I(X) equal to 1 + e and all the rest equal to 1 + 2e. Clearly, the 
point cx (1 + e) -covers exactly the elements of X. ■ 

For q G P and e > 0, define S® 6 = {x G R d \ x < (1 + e) • q}; the cone Sg e is the subset of R d 
that (1 + e)-covers q. A point r (1 + e)-covers g if and only if r G 5^ e . The "dual" set system of T(P, e) 
is defined as T D (P,e) = (P,S D (P,e)), where S D {P,e) = {P® e = P n | q G P}. In words, the 
elements are the points of P and for each point q G P we have a set consisting of the points r G P that 
(1 + e)-cover g. An e-Pareto set of P is equivalent to a hitting set of T D . 

It is well-known BAsll that, if a set system has VC-dimension at most d, the VC-dimension of the dual set 
system is upper bounded by 2 d+1 — 1. However, in our setting, essentially the same proof as in the previous 
lemma establishes the following: 

Lemma 4.6. For any finite set of points P C H d and e > 0, it holds VC-dim(T D '(P, e)) < d. This bound is 
tight. 

Proof. Let P be a set of points in M, d and e > 0. We must argue that no subset P' C P of cardinality d + 1 
can be shattered by S D (P, e). Similarly to the previous lemma, any set P' C P of cardinality d+ 1 contains 
a point r such that for all i G [d] there exists some point q l G P' (q l / r) satisfying (cf )j > r^. We claim 
that we cannot "separate" r from the remaining points. Indeed, if some point is (1 + e) -covered by all the 
g*'s, then is also (1 + e)-covered by r. The tightness is similar - . ■ 

It is well-known that, for a set system of VC-dimension at most d, we can efficiently construct an 1/r- 
net of size s(r) = 0(dr log r) HKPW1 : this bound is tight in general llPWllKPWIl . As shown in llBGllEPCSl . 
for such a set system, there exists a polynomial time s( OPT) /OPT-f actor approximation algorithm for the 
minimum hitting set problem, where OPT is the cost of the optimal solution. If we apply this result to the 
dual set system P D (P, e) we conclude: 

Proposition 4.7. Problem Q can be approximated within a factor ofO(dlog OPT e ). 
Part 1 of Theorem 14. 1 1 follows by combining Lemma 1431 and Proposition [4/7] 

Remark 4.8. If s(r) = 0(r), the reduction in MBGHERSl implies a polynomial time constant factor approxi- 
mation algorithm for the corresponding hitting set problem. This is exactly the approach in BKPII : they show 
that, for d = 3, J rD (P, e) admits an 1/r-net of size s(r) = 0(r) and that such a net can be efficiently con- 
structed. Note that the constant approximation ratio c implied for set cover using this approach is identified 
with the constant hidden in the big-Oh of the net-size s(r). The corresponding constant in the construction 
of BKPII . itself based on a result of ICVL is quite large and no good bounds have been calculated for it. A 
recent result BPRH implies that the dual set system induced by a finite set of points and translates of an orthant 
in R 3 (a generalization of J rD (P, e)) admits an 1/r-net of size at most 25r (that is efficiently constructible). 
Hence, for d = 3, problem Q can be efficiently approximated within a factor of 25 and the constant c in (the 
second statement of) Theorem 14. II is at most 25. Improving the value of this constant is an interesting open 
problem. 
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4.2 The Dual Problem 



For a (i-objective problem IT with an associated GAP routine, given a parameter k, we want to find k 
solution points that provide the best approximation to the Pareto curve, i.e. such that every Pareto point is 
p*-covered by one of the k selected points for the minimum possible ratio p* = 1 + e*. It was shown in 
iTVYl that for d = 2 the problem is NP-hard but has a PTAS. We show below (Section |4~2TTT ) that for d = 3 
any multiplicative factor for the dual problem is impossible, even for explicitly given points; we can only 
hope for a constant power, and only above a certain constant. 

In BVYII the dual problem was related to the asymmetric /c-center problem, and this was used to show that 
(i) for any d, a set of k points can be computed that approximates the Pareto curve with ratio (p*)°( lo s* k \ 
and (ii) for unbounded d and explicitly given points, it is hard to do much better. Since the metric p for the 
dual problem is a ratio (multiplicative coverage) versus distance (additive coverage) in the A;-center problem, 
in some sense the analogue of constant factor approximation for the Dual problem is constant power. Can 
we achieve a constant power (p*) c for all problems in MPTAS with a fixed number d of objectives? We 
show (Section T4.2.2I ) that the answer is Yes for d = 3 and provide a conjecture that implies it for general d. 

4.2.1 Lower Bound 

Consider the problem V(P, k) (dual problem with explicitly given points): We are given explicitly a set P of 
n points in R+ and a positive integer k and we want to compute a subset of P of cardinality (at most) k that 
p-covers P with minimum ratio p. Let p* = 1 + e* denote the optimal value of the ratio. Note that problems 
Q and V are polynomially equivalent with respect to exact optimization - as opposed to approximation. As 
shown in BKPII . Q is NP-hard for d > 3; hence, for d > 3, problem V is also NP-hard. 

By further exploiting the properties of the aforementioned reduction in [KPj, we can show that problem 
V is NP-hard to approximate. Before we proceed with the formal statement and proof of this fact, it will 
be helpful to give some remarks regarding the notion of "approximate coverage" in the definition of the 
approximate Pareto set. Throughout this paper, our notion of coverage is multiplicative: for p > 1, a point 
u € R+ p-covers a point v G R+ iff u < p ■ v (coordinate-wise). Alternatively, one could define the notion 
of coverage additively: for c > 0, the point u € R+ additively c-covers v E R+ iff Ui < Vi + c for all i. 
A notion of additive c-Pareto set can be naturally defined using the additive coverage. (Note that with the 
additive definition of coverage Pareto sets and approximate Pareto sets are invariant under translation of the 
input set, while with our multiplicative definition they are invariant under scaling.) 

On the one hand, the selection of multiplicative metric is standard and more natural in the context of 
approximation algorithms. On the other hand, it is essential in our setting in the following sense: For a 
(implicitly represented) multiobjective combinatorial optimization problem, the basic existence theorem of 
HPYll (i.e. the fact that there always exists an e-Pareto set of polynomial size) is based crucially on the 
multiplicative coverage. (In fact, it clearly does not hold under the additive coverage. This, of course, 
rules out the possibility of efficient algorithms for computing (any) approximate Pareto set in this context.) 
However, for the case that the set of points is given explicitly in the input (i.e. for problems Q and V) the 
aforementioned obstacle does not occur and one can select the definition of coverage that is more appropriate 
for the specific application. 

We will denote by log Q and log V the primal and dual problems respectively under additive coverage. 
We now try to relate the problem pairs (Q, log Q) and (V, logV) with respect to their approximability. To 
this end, we need a couple of more definitions. For two points p,q G R^ the ratio distance between p and 
q is defined by: lZV(p, q) = max{maxj(pj/gj), 1}. (The ratio distance between p and q is the minimum 
value p* = 1 + e* of the ratio p such that p p-covers q.) The additive distance between p and q is defined by: 
AV(p, q) = max{maxj(pj — qi),0}. (Analogously, the additive distance between p and q is the minimum 
value c* of the distance c such that p additively c-covers q.) It is easy to see that AT>(-, •) is a directed 
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pseudo-metric. 

We claim that the problems Q and log Q are in some sense "equivalent" with respect to approximability. 
Indeed, it is easy to see that an r-approximation algorithm for problem Q implies an r-approximation for 
problem log Q and vice-versa (by taking logarithms and exponentials of the coordinates respectively). Sup- 
pose for example that there exists a factor r approximation for log Q. We argue that it can be used as a black 
box to obtain an r-approximation for Q. Given an instance (P, e) of Q, we construct the following instance 
of log Q: We take the set of points P', where P' contains a point p' for every point p G P whose coordinates 
are the logarithms of the corresponding coordinates of p. We also take c = log(l + e). That is, we ask for 
the smallest additive c-Pareto set of P'. If p', q' £ P' are the images of p, q G P respectively, we have that 
lZT>(p, q) = 2- AT '( p ' q > . Hence, there exists a bijection between e-Pareto sets of P and additive c-Pareto 
sets of P' , i.e. this simple transformation is an approximation factor preserving reduction of Q to log Q. 
There is however a subtle point regarding the bit complexity of the produced instance: the coordinates of the 
points in P' (and the desired additive coverage c) may be irrational, thus not computable exactly. We argue 
that this is not a significant problem below. 

Consider an instance (P', c) of log Q. (The following remarks also hold for Q and the dual problems.) 
Clearly, the feasible solutions to the problem, i.e. the (additive) c-Pareto sets of P' , do not depend on the 
actual coordinates of the points in P', but only on the additive distance between every pair of points. Hence, 
the only information an (exact or approximate) algorithm for log Q needs to know about the input instance 
is the set of pairwise distances. In fact, such an algorithm does not need an explicit representation of these 
distances as rational numbers. It is sufficient to have a succinct representation that allows: (i) efficiently 
computing a succinct representation of the sum of two (or more) distances (ii) efficiently comparing any 
two (sums of) distances and (iii) efficiently comparing (sums of) distances with c. Now the aforementioned 
transformation produces instances (P', c) of problem log Q that clearly satisfy these properties (since we 
have an explicit representation of the starting instance (P, e) of Q and we take logarithms). Hence, an r- 
approximation algorithm for log Q can be used as a black box to obtain an r-approximation for Q. Similar 
arguments may be used for the other direction. 

For the dual problem, the choice of coverage (multiplicative versus additive) changes the objective 
function, which affects the approximability. Roughly speaking, a factor r-approximation algorithm for log V 
is "equivalent" to a (/?* ^-approximation algorithm for V, where p* is the value of the optimal ratio for the 
latter problem. For example, it is easy to see (by taking logarithms as above) that a factor r approximation 
for logV implies a (p*)' r -approximation for V. 

We have the following: 

Theorem 4.9. Consider the problem T>(P, k)for d = 3 objectives. 

1. It is NP -hard to approximate the minimum ratio p* within any polynomial multiplicative factor. 

2. It is NP-hard to compute k points that approximate the Pareto curve with ratio better than (p*) 3 / 2 . 

Proof. To prove both parts we take advantage of the properties in the NP-hardness reduction of I1KPI . It 
is shown there that problem log Q is NP-hard for d = 3 via a reduction from 3-SAT. Given an instance of 
3-SAT, the reduction produces an instance (P, c) of log Q such that the smallest additive c-Pareto set of P 
reveals whether the 3-SAT formula is satisfiable. We will not repeat the reduction here, but we will just give 
the properties of the construction below needed for our purposes. We prove each part separately. 

1. The crucial property we need here is that the reduction in HKPl is strongly polynomial: Given an 
instance (formula) ip of 3-SAT with n-clauses, the reduction constructs an instance of log Q (or logV), 
consisting of a set P of points in 3 dimensions and an additive error bound c such that, if the formula 
ip is satisfiable then P has a (additive) c-cover with g points (for some parameter g of the construction), 
whereas if ip is not satisfiable then every c-cover must contain at least g + 1 points. The construction has the 
property that all the points of P have rational coordinates with 0(log n) bits and the error bound c ~ 1/n 2 
(to be precise, c = l/4n 2 ). This property implies that in the (additive) dual problem logV with a bound 
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k = g for the number of points in the cover, the additive "gap" in the value of the optimal covering distance 
between the Yes case (satisfiable 3-SAT instance ip) and the No case (non-satisfiable 3-SAT instance) is at 
least inverse polynomial in n, i.e. at least 5 = l/n r , for a (small) constant r: If the 3-SAT instance ip is 
satisfiable, the optimal value of the covering distance for the log V instance P with k = g is c; if p is not 
satisfiable, the optimal distance is at least d = c + 5. By multiplying all the coordinates of the constructed 
instance by a factor of 2n r+l , where / > is a constant, and rounding to the nearest integer, we get a new 
instance of log V where all the points have integer coordinates and the value of the additive gap between 
the satisfiable and the unsatisfiable case is at least n l . We then exponentiate each coordinate (x — ► 2 X ). The 
number of bits remains polynomial in the size of the original 3-SAT instance (thus the overall reduction 
takes polynomial time) and the value of the multiplicative gap is now 2 n . 

2. To prove this part, it suffices to show that problem log V does not have an approximation ratio better 
than 3/2. The reduction in BKPI1 uses a number g of gadgets. The construction has gadgets for the variables 
and for the clauses, which are connected by paths of flip-flop gadgets that cross using crossover gadgets. 
If the formula is satisfiable, then we can cover the points with additive distance c with g points, one from 
each gadget. Otherwise, this is not possible. We thus select k = g and ask for the "best k points" and 
the corresponding optimal covering distance c*. As previously mentioned, if the formula is satisfiable, we 
have c* = c. Now, if the formula is not satisfiable, we argue below that the optimal covering distance is 
c* > 3c/2. The proof follows directly from this. 

Suppose that the 3-SAT formula is not satisfiable and we want to select the best g points. First, we note 
that we still need one point from each gadget because otherwise all the points of a gadget must be covered 
by points in other gadgets that are "far away" (much further than c), since the gadgets are well-separated; 
that is, if some gadget contains no point of the solution then the covering distance is much larger than c. 
Since the formula is not satisfiable, after selecting g points, at least one gadget will remain "badly covered", 
i.e. the point we selected must cover more points of its gadget than its c-neighborhood. An examination 
of the three types of gadgets used in the construction shows that this gives covering distance 2c for both 
the flip-flop and clause gadgets and at least 3c/2 for the crossover gadgets. Hence, if the formula is not 
satisfiable, the optimal covering distance is c* > 3c/2. ■ 

4.2.2 Upper Bound 

Consider the following generalization Q'(A, P, 1 + e) of problem Q: Given a set of n points P C R^_, a 
subset A C P and e > 0, compute the smallest subset P*(A) C P that (1 + e)-covers A. It is easy to see 
that for d = 3 the arguments of HKPIIPRJ1 for Q can be applied to Q' as well showing that it admits a constant 
factor approximation (see Remark |4~8T ). We believe that in fact for all fixed d there may well be a constant 
factor approximation. Proving (or disproving) this for d > 3 seems quite challenging. The following weaker 
statement seems more manageable: 

Conjecture 4.10. For any fixed d, there exists a polynomial time ((1 + e) a ^ , /3(d))-bicriterion approxima- 
tion algorithm for Q'(A, P, 1 + e), i.e. an algorithm that outputs an (1 + e) a ^ -cover C C P of A, satisfying 
\C\ < (3(d) ■ \P*(A)\, for some functions a,P : IN -»■ M. 

For d = 3, Conjecture 14. 101 holds with a(3) < 2, and (3(3) < 4. This can be shown by a technical 
adaptation of the 3-objectives algorithm in IIVY1 . 

For general implicitly represented multiobjective problems with a polynomial GAP5 routine, we for- 
mulate the following conjecture: 

Conjecture 4.11. For any fixed d, there exists a polynomial time generic algorithm, that outputs an (1 + 
e ) a ( d ) -cover C, whose cardinality is \C\ < (3(d) ■ OPT ' e , for some functions n . i : IX • IX. 
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The case of d = 3 is proved in [VY] with a(3) = any constant greater than 2 and /3(3) = 4. Note that, 
by (a variant of) Lemma 1431 Conjecture 14. 101 implies Conjecture 14. Ill The converse is also partially true: 
Conjecture 14.111 implies Conjecture 14.101 if in the statement of the latter, problem Q! is substituted with 
problem Q. 

In the following theorem, we show that a constant factor bicriterion approximation for Q! implies a 
constant power approximation for the dual problem, given the GAP routine. 

Theorem 4.12. Consider a (implicitly represented) d-objective problem in MPTAS and suppose that the 
minimum achievable ratio with k points is p*. 

1. For d = 3 objectives we can compute k points which approximate the Pareto set with ratio 0((p*) 9 ), 
using 0{{m/5) d ) GAP 5 calls, where 1/5 = 0(l/(e' - e)). 

2. If Conjecture \4. 10\ holds. then for any fixed d we can compute k points which approximate the Pareto set 
with ratio 0((p*) c ), using 0((m/5) d ) GAP 8 calls, where 1/5 = 0(1/ (e 7 - e)) and c = c(d). 

Proof. Part 1 follows from 2 since Conjecture 14. 101 holds for d = 3. (It will follow from the proof that 
c(3) < 9.) To show Part 2, we exploit the relation of problem V(P, k) with the asymmetric /c-center 
problem. As observed in BVYl . the problem log V is an instance of the asymmetric /c-center problem, which 
we now define for the sake of completeness. In the asymmetric fc-center problem we are given a set of 
n vertices V with distances, dist(u, v) that must satisfy the triangle inequality, but may be asymmetric, 
i.e. dist(«, v) ^ dist(f,u). We are asked to find a subset U C V, \U\ = k, that minimizes dist* = 
maxmindist(u, v). (Note that log'D(P, k) is an instance of this problem, where there exists a bijection 

between vertices of V and points of P and the distance between points (vertices) p, q G P is defined as 

d(p,q) =AV(p,q).) 

We claim that, if problem Q'(A, P, 1 + e) admits a ((1 + e) a ^, /3(d))-bicriterion approximation, then 
problem T>(P, k) admits a (p*) c ^ approximation for some function c (that depends on a and (3). This is 
implied by the aforementioned reduction and the following more general fact: If we have an instance of the 
asymmetric £>center problem (problem log V(P, k) in our setting) such that a certain collection of associated 
set cover subproblems (which are instances of problem log Q'(A, P, 1 + e) here) admits a constant factor 
bicriterion approximation (an algorithm that blows up both criteria by a constant factor), then this instance 
admits a constant factor unicriterion approximation (an algorithm that outputs a set of no more than k 
centers). This implication is not stated in HP VI I Aril , but is implicit in their work. One way to prove it is 
to apply Lemma 5 of MPVII in a recursive manner. We will describe an alternative method HAr21 that yields 
better constants. We prove this implication, appropriately translated to our setting, in Lemma 14.141 

For a general multiobjective problem where the solution points are not given explicitly, we impose a 
geometric y/1 + 5 grid for a suitable 5, call GAP<j at the grid points, and then apply the above algorithm to 
the set of points returned. Then the set of k points computed by the algorithm provide a (1 + e') c ^ -cover 
of the Pareto curve, where 1 + e' = (1 + e)(l + 5) 2 . ■ 

Remark 4.13. Even though the 0(log* /^-approximation ratio is best possible for the (general) asymmetric 
/c-center problem MCG+l . the corresponding hardness result does not apply for log V as long as the dimension 
d is fixed. 

Let H(a) denote the harmonic number extended to fractional arguments by linear interpolation (i.e. 

H( a ) = Yl\=i 1/i + {oc — L Q J)/[ a l)- F° r a function g, let <jrM denote the function iterated i times. 
Finally, for b > 1 define H£(a) = mm{i : H^\a) < b}. The following lemma completes the proof of 
Theorem 14. 121 

Lemma 4.14. Suppose that there exists an ((1 + e) a , f3)-bicriterion approximation for Q'(A,P, 1 + e). 
Then, problem T>(P, k) admits a (p*) c approximation, where c = H^^iP) +ot + A. In particular, for a = 2 
and (5 = 4, we can get c = 9. 
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Proof. The desired result can be shown by a careful application of the techniques introduced in BPVIIArlt . 
We describe an algorithm - that we denote V(P, k), as the corresponding problem - which, given a (p a ,f3)- 
bicriterion approximation algorithm, denoted B{A, P, p), for problem Q'(A, P, p) as a black box, computes 
a set Q C P of (at most) k points that (p*) c -cover the set P, where p* is the minimum ratio achievable with 
k points. We will denote by B(A, P, p) the set of points output by the algorithm B on input (A, P, p). 

We first note the simple (and well-known) fact that it is no loss of generality to assume that the algorithm 
V(P, k) "knows" the optimal ratio p*; this is because p* will be one of the 0(|P| 2 ) pairwise ratio distances, 
hence we can try the algorithm for all of them and pick the best solution (or do an appropriate binary search, 
see e.g. E3)). 

To describe the algorithm, we appropriately translate the notions from BPVI I Aril to the current setting. 
For the sake of completeness, we also provide a mostly complete proof of correctness. 

We begin with a basic definition. 

Definition 4.15. For a point q G P and a parameter p > 1, we denote T + (q, p) = {p G P | q < p ■ p} the 
set of points in P p-covered by q and T~(q, p) = {p G P \ p < p • q} the set of points in P that p-cover q. 
We naturally extend this notation to sets SCP: r ± (S, p) = {p G P \ p G r ± (s, p) for some s G S}. We 
say that the point q G P is a p-center capturing vertex (denoted p-CCV) if it satisfies T~(q, p) C T + (q, p). 

Consider an instance of the problem T>(P, k) as defined above. Suppose that p > p*. In this case, if 
the point q is a p-CCV, it p-covers at least one point of the optimal solution - in particular, the point q* 
that p*-covers q. Indeed, q* G T~(q,p*) C T~(q,p) C T + (q, p). Hence, q p 2 -covers every point in P 
p-covered by the point q*. This simple property is crucial for the algorithm. 

The algorithm in IIP VII has two phases. In the first phase, roughly, it preprocesses the input set by 
iteratively finding CCV's and in the second phase it uses a recursive set cover procedure to cover the points 
not covered in the first stage. (The algorithm in BArlll replaces the second phase by an LP-based method.) 

The algorithm V{P, k) works in three phases. The first phase is identical to the first phase in IIP VII Aril : 
We preprocess the input set P by iteratively finding p*-CCVs. In the second phase, V calls the bicriterion 
approximation algorithm B (with appropriately selected values of its parameters) to cover the subset of P 
that is not covered in the first phase. The remaining phase involves a careful application of the recursive 
greedy set cover procedure of BPVI followed by an application of the greedy set cover algorithm. To show 
correctness of the last step, we use the structural lemma of BArlll (itself a variant of a similar lemma in BPVI . 
albeit with improved constants). 

The algorithm is presented in detail below: 

We now proceed with an intuitive explanation of the different steps in tandem with a proof of correctness. 
We explain first what happens during the first phase. We have as input the set P, the parameter k and the 
optimal ratio p*. (Recall that the algorithm can "guess" the optimal ratio.) We iteratively select p*-CCV's 
as follows: For each p*-CCV we find, we remove from the "active" set A (initialized to P) all the points 
(p*) 2 -covered by it, until no more CCV's exist in A. Let C be the set of CCV's thus discovered (|C| < k) 
and A = P \ T + (C, (p*) 2 ) be the set of points in P not (p*) 2 -covered by any point in C. At this point, we 
note the following simple fact: 

Fact 4.16. The set A:=P\ T + (C, (p*) 2 ) can be p* -covered by k' = k - \C\ points in P \ T + (C, p*). 

If \C\ = k (k' = 0, A = 0), we have selected a set of k points that (p*) 2 -cover the set P and we can 
just terminate the algorithm. Otherwise, we proceed with the next phase. In the second phase, we call the 
algorithm B to p* -cover the set A. By Fact 14.161 there exists a p* -cover of A with kl points. Moreover, it 
is clear that such a cover lies in P \ T + (C, p*). Hence, we get a set So C P \ T + (C, p*) of cardinality 
I So | < (3 • k' that (p*) a -covers A. To motivate the next step, we note the following immediate implication 
of Fact OBI 
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Algorithm V(P, k) 

(The optimal radius p* is known to the algorithm.) 
{Phase 1) 

A = P;k' = k;C = 0; 

While there exists a p*-CCV q £ A and k' > do 

{C = Cu{q}; 

A = A\T+(q, (p*) 2 ); 
k' = k' - 1; } 

(Phase 2 ) 

S =B(A,P\T+(C,/?),p*y, 
(Phase 3) 

50 = S \r + (C, (p*) 2 ); 

51 = Rec-Cover (5 , A, P, p*, k')\ 

51 = Si\T+{C, (p*) 4 ); 

5 2 = Greedy-Set-Cover (Si, P, (p*) 3 ); 

Return Q:=CU S 2 . 



Routine Rec-Cover (Input: S, A, P, p, I) 
(There exist I vertices in P that p-cover S, 
where 5CiC P.) 

S° = S; i = 0; 
While 15*1 > 4Z/3 do 
{ 

Run Greedy Set Cover to p-cover S l using points 
of P and let S' l+1 C P be the produced set. 

i = i + 1; 

} 

Return S\ 



Fact 4.17. Le? 5 C A, Then S can be p* -covered by k' points in P \ T + (C, p*). 

We also recall the following well-known fact MChvl Uohl ILovl about the performance guarantee of the 
greedy set cover algorithm: 

Fact 4.18. For a set system (U,1Z) suppose that there exists a set cover of cardinality p. Then the greedy 
algorithm outputs a cover of size at most p ■ H(\U\/p). 

At this point, we apply the recursive greedy set cover procedure from HP VII to cover Sq = Sq n A using 
points from A. (The points in Sq \ Sq are (p*) 2 -covered by C.) Note that in each round of the recursive 
cover, we attempt to cover only those points from the last round that do not lie in r + (C, (p*) 2 ), since C will 
cover those ones. We thus get a set S\ C P of cardinality \S\\ < 4fe'/3 with the property that Si covers 
S \ r+(C, (p*) 1+H ±/z {!3) ) with ratio (p*) H */^® . The latter statement can be shown by induction, using 
Fact 14.171 as an invariant. Since this essentially appears in UPVIIArll (see e.g. Lemma 13 in HArllO . we do 
not repeat it here. To motivate the next step, we need the following combinatorial lemma from MArlll : 

Lemma 4.19 (Theorem 17 in llArTl . rephrased). Let C C P and A = P \ T + (C, (p*) 2 ). Suppose A has 
no p*-CCV's and that there exist k' centers (points in P) that p*-cover A. Then there exists a set of2k'/3 
centers in P \ T+(C, p*) that (p*f -covers A' = P\ T + (C, (p*) 4 ). 

As a final step of the algorithm, we apply the greedy set cover algorithm - that may be viewed as one 
iteration of the recursive procedure - with parameter (p*) 3 to cover Si = Si \ T + (C, (p*) 4 ) using points 
from P (so that the optimum has cardinality at most 2fc'/3, according to Lemma I4.19I ). (Note that the 
points in Si \ Si are (p*) 4 -covered by C.) We thus get a set S 2 C P of cardinality at most (2A//3) • 
H ((4k'/ '3) I (2k' I '3)) = (2k! j 3) ■ H(2) = k! with the property that S 2 covers Si within (p*) 3 . We output 
the set Q := C U S 2 ; this set has cardinality at most k and it remains to argue that it covers P with ratio 
(p*f +a+H l/^\ Indeed, every point p e P falls in one of the following categories: 

• The point p is (p*) 2 -covered by a point in C, i.e. p G r + (C, (p*) 2 )- (Note that if this is not the case, 
i.e. if p E A, then it is (p*) a -covered by Sq.) 
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• The point p is (p*) a -covered by a point po G So that is not (p*) Hi / 3 ^' - covered by Si. In this case, 
po G r+(C, (p*) 1+ ^*/3 (/3) ), so C covers p within ratio ( p *) 1+H m^ +a . 

• The point p is (p*) a -covered by a point po G So that is (p*)^ 4 / 3 ^ - covered by a point pi G Si that is 
not (p*) 3 -covered by S 2 . In this case, p 1 G T+(C, (p*) 4 ), so C covers p within ratio ( y0 *) 4+H 4/ 3 (/ 3 )+ Q _ 

• The point p is (p*)" -covered by a point po £ 5*0 that is (p*)^ 4 / 3 ^-* - covered by a point p\ G Si that 
is in turn (p*) 3 -covered by p 2 G S2. In this case, the point p 2 covers p within ratio (p*) 3+ ^ 4 /3^) +a . 

Hence the overall covering ratio is (p*) 4+ ^ 4 /3^) +a > which completes the proof. ■ 



Remark 4.20. We note here that the recursive set cover procedure (used in the above lemma) was useful 
merely to improve the constants in the reduction. One can alternatively prove a (quantitatively inferior) 
version of the lemma by the following two-phase algorithm: In the first phase, preprocess the input set P by 
iteratively finding p-CCVs for appropriately chosen values of the parameter p. In the second phase, call the 
algorithm B to "cover" the subset of P that is not covered in the first phase. The analysis of this alternative 
algorithm is based on repeated applications of Lemma [4.19i 



Remark 4.21. We should remark that the algorithms of this section are less satisfactory that the bi-objective 
algorithm of the previous section (and the 2-d and 3-d algorithms of HVYII ) in several respects. One weakness 
is that the constants c obtained (for d = 3) are quite large: in the case of Theorem 14.11 the best constant c 
we can get follows from the net construction of BPRII (and is about 25). In the case of Theorem 14 . 1 2 1 there is 
still a large gap between the upper bound (of 9) and the lower bound (of 3/2) in the exponent. 

A second weakness of the algorithms is that they start by applying the general method of MPYll calling 
the GAP routine on a grid, and thus incur always the worst-case time complexity even if there is a very small 
e-Pareto set. Thus, we view our algorithms in this section mainly as theoretical proofs of principle, i.e. that 
certain (constant) approximations can be computed in polynomial time, but it would be very desirable and 
important to improve both the constants and the time. 



5 Conclusion 

We investigated the problem of computing a minimum set of solutions for a multiobjective optimization 
problem that represents approximately the whole Pareto curve within a desired accuracy e. We developed 
tight approximation algorithms for the bi-objective shortest path problem, spanning tree, and a host of other 
bi-objective problems. Our algorithms compute efficiently an approximate Pareto set that contains at most 
twice as many solutions as the minimum one; furthermore improving on the factor 2 for these specific 
problems is NP-Hard. The algorithm works in general for all bi-objective problems for which we have a 
routine for the Restricted problem of approximating one objective subject to a (hard) bound on the other. 
The algorithm calls this Restricted routine and a dual one as black boxes and makes quite effective use of 
them: for every instance, the number of calls is linear (at most 4 times) in the number of points in the optimal 
solution for that instance. 

We presented also results for three and more objectives, both for the problem of computing an optimal 
e-Pareto set and for the dual problem of selecting a specified number k of points that provide the best 
approximation of the full Pareto curve. As we indicated at the end of the last section, there is still a lot of 
room for improvement both in the time complexity and the constants of the approximations achieved. We 
would like especially to resolve Conjecture 14.111 hopefully positively. It would be great to have a general 
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efficient method for any (small) fixed number d of objectives that computes for every instance a succinct 
approximate Pareto set with small constant loss in accuracy and in the number of points, and do it in time 
proportional to the number of computed points, i.e., the optimal approximate Pareto set for the instance in 
hand. 

Acknowledgements. We would like to thank Aaron Archer for useful discussions related to the asymmetric 
/c-center problem. 
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